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REPORT  SUMMARY 


Diagnostics  have  been  produced  for  all  Processing  Element 
(PE)  and  Memory  Logic  Unit  (MLU)  printed  circuit  card  types  (a  total 
of  ^0),  except  those  redesigned  types  for  which  logic  prints  have  not 
been  received  from  Burroughs .   Heavy  effort  is  being  expended  towards 
production  of  Control  Unit  (CU)  printed  circuit  cards. 

The  combinational  logic  and  path  test  phases  of  the  off-line 
PE  diagnostics  have  been  completed.  Work  is  proceeding  on  the  control 
logic  tests . 

Generally  speaking,  the  software  effort  is  on  schedule  and 
making  progress.   The  main  perturbation  to  the  schedule  is  the  temporary 
shelving  of  Operating  System  II  (OSIl)  and  using  the  fixed  resources 
from  this  effort  to  consider  interactive  communications  with  the  ARPA 
Network.   Operating  System  I  (OSI)  is  in  the  final  stages  of  coding  and 
the  end  of  the  next  quarter  should  see  much  of  the  ILLIAC  IV  system 
simulated  on  the  B5500. 

ASK  and  GLYPNIR  are  being  consolidated  in  preparation  for 
transference  to  the  Burroughs  B65OO.   The  extended  FORTRAN  design 
(entitled,  COCKROACH)  is  essentially  complete;  plans  are  in  hand  for 
an  extended  tour  to  present  it  to  users. 

Project  expenditures  for  FY70  are  as  follows: 

University  of  Illinois        $2, 197, 6H6 . 65 
Burroughs  Corporation         $6,711,392.00 

At  their  June  meeting,  the  University  of  Illinois  Board  of  Trustees 
approved  the  establishment  of  the  Center  for  Advanced  Computation  as 
the  administrative  unit  for  operating  the  ILLIAC  IV.   The  Center  for 
Advanced  Computation  will  be  organized  with  a  Director's  Office, 
which  will  include  the  Business  Office,  and  three  operating  sections: 
Applications,  Systems  Service,  and  Software. 


The  project  has  entered  into  a  contract  with  Precision 
Instruments  Company  for  the  purchase  of  a  laser  storage  system  which 
will  be  unique  in  concept  and  will  have  the  largest  known  on-line 
storage  capacity  of  any  computing  system  to  date. 
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1.   HARDWARE 

1.1  Diagnostics 

1.1.1  Logic  Simulation 

1.1.1.1  PE  Simulator 

The  PE  Simulator  was    again  used  this  period  for  verification 
of  tests.      Due  to  speed-up  logic  modifications,    a  few  discrepancies 
between  PE  simulations    and  the   actual  PE  have  been   found.      The  PE 
Simulator  will  be  modified  when  a  design  change  notice   is   available 
from  Burroughs.      Documentation  of  the  PE  Simulator  will  be   finished 
during  the  next  quarter. 

1.1.1.2  CU  Simulator  System 

With  the  exception  of  changes  to  the  simulators  themselves, 
the  CU  Card  Simulator  Generator  System  appears   to  have   stabilized.      It 
has   required  no  modifications   this   quarter. 

The   CU/PE  card  simulators  were  modified  to  be   compatible  with 
the   cross   reference   dictionary  program.      Various   small   changes  were 
also  made  to  make  the  execution  of  these  programs   less   troublesome. 

Programs    for  generating  the  partitioned  card  simulator  bodies 
for  the   CU  Section  Simulator  have  been  developed.      These  programs    are 
based  on  the   original  CU  card  simulator  generator  programs — with  some 
additional   features . 

The   individual   CU  cards    are    first  partitioned  into   "input" 
and   "output"  parts.      A  program  splits   the  netlist   into  two  parts,   taking 
partitioning  into  account.      Flip-flop  and  connector  packages    are  split 
into  two  parts.      Signal  sorting  and  package  sorting  are   done  next. 
Later,   leveling,   loop  detection,    and  ordering  the   levels    for  the  loops 
are   done.      Equation  generation  is   done  the  same  way   as    for  unpartitioned 
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cards ,   except   for  the  flip-flop  packages .      The  input  part   of  the  board 
can  have  either  both  the  input   and  output  parts   of  a  flip-flop  or  only 
the  input  part.      Hence   at   level   zero    (i.e.    for  output  part   of  the 
flip-flop)    a  table  is   prepared  containing  the  package   location  with  the 
relevant  pin  numbers   appearing  in  the  output  part.      At   level  100  this 
table  is    checked  and  the  equation  is   generated  accordingly,    depending 
on  whether  both  the  latches   in  the   flip-flop  package   are  used  or  only 
the   input   part  is  present,    or  both  the  input   and  output  parts   are  used. 
A  table  is   constructed  giving  the  package   location  and  the  array  sub- 
scripts   of  the  present   and  next  states.      During  the  output  part   equation 
generation,   this   table  is   consulted  and  the   appropriate  subscript   is 
picked  up   and  the  equation   for  the  output  part   of  the   flip-flop  is 
generated. 

Execution  of  all  the  programs   through  the  simulator  generator 
can  be   automatically   chained.      After  splitting  the  netlists   into  two 
parts,    the  normal   card  simulator  generator  sequence   of  programs    are 
executed  with  two  different  sets    of  data,   one   for  the   input  part   of  the 
board  and  the  other  for  the   output  part   of  the  board. 

1.1.2     Card  Test   Generation 

1.1.2.1     Card  Test  Generation  System 

The   failure   location  subsystem,    consisting  of  one  of  the  bad 
board  simulators    (LOCATE)    and  the   cross   reference   dictionary  programs 
(CUFAIL/XREF  and  PEFAIL/XBEF)   has  been  plagued  with  minor  bugs   this 
quarter.      Since  no  new  errors  have  been   found  in  more  than   a  month,  we 
are  assuming  the  programs    are   error- free   and  have   gone  into  production. 

Due  to  the  large   amount  of  continuous   machine  time   required  to 
execute  these  programs,   it  became   clear  that   a  great   deal  of  effort  was 
being  wasted  in  HALT/LOAD's,   particularly   in  the  case  of  CUFAIL/XREF. 
Therefore,    CUFAIL/XREF  has  been  modified  to  be  marginally  HALT/LOAD  proof. 
It   is   anticipated  that  LOCATE  will  be  HALT/LOAD  proofed  soon. 
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Various    changes  have  been  made  in  the  test  generation  program 
(CUTEST/RANDOM  or  CUTEST/CONTROL)   to   facilitate  its   use. 

The  major  weak  point  in  the   CU  Card  Test  Generation  System 
at  present  is   the  test  initialization  subsystem  which   is  being  delayed 
by  modifications   in  the   compiler  for  the  test  language    (TESLA).      These 
modifications    are  nearing  completion. 

1.1.2.2      Card  Test  Translation  System 

An   operational  test  translation  system  for  automatically 
generating  PEX  compiler  source   code   from  diagnostics   data  has  been 
completed   (PEXTRAN/PEXTAP)    and  tests    for  several  combinational   CU 
boards  have  been  generated  and  are   currently  being  executed  in  Paoli. 
Minor  modifications  have  been  implemented  as   problems    are  uncovered  in 
Paoli . 

Initialization  procedures    can  also  be  generated  for  boards 
containing   flip-flops    (PEXTRAN/INITRAN)   using  a  TESLA  file  as   input. 
Modifications    are  being  implemented  in  TESLA  temporarily  delaying  trans- 
lation of  non-combination  boards.      Ultimately,   to  write   a  board  initial- 
ization,  one  will   encode   a  TESLA  source  program  and  compile  it  with  the 
TESLA  compiler,  which  will   zip  the  simulator  itself,    zip  TESLA/LINEPRINT, 
and,    if  all   is  well,    zip   INITRAN.      Once   an  initialization  has  been 
created  for  each  test  pattern    (FAWP)    file,   the  main  translation  program 
can  be  executed,    zipping  the  PEX  compiler. 

Since  the  size  of  the  PEXTAP  source  programs    can  go  as  high 
as   8000   card  images  with   as   many  as   1000  procedure   calls  ,   and  since 
the  PEX  compiler  slows   down  dramatically  with  the   addition  of  procedure 
calls,    a  new,    "smarter"  version  of  the  PEXTRAN  series   is  under  develop- 
ment, which  will   include  initializations   as   in-line  code   creating 
somewhat   longer  printouts,  but,  hopefully,   reducing  computer  time 
substantially.      A  new  version  of  the  PEX  compiler  is  presently  being 
debugged  in  Paoli  with   expectations    of  increased  efficiency. 
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1.1.2.3     PE  Card  Test  Generation 

All  PE  card  types  whose  logic  diagrams  had  been  released  to 
the  University  prior  to  June  15th  have  been  completely  processed  through 
the   diagnostics   generator  system.      Test   dictionaries   and  TI   56IA  tester 
codes    for  kO  unrevised  PE  and  MLU  card  types  have  been  generated  and 
sent  to  Burroughs   for  use  in  PE  card  production  testing.      Diagnostics 
must  still  be   generated  for  17   card  types    (six  new  card  types   and  eleven 
revised  cards).      Logic  prints    for  some  of  these  were  received  during  the 
last   few  days   of  the  quarter,   logics    for  the   others  have  not  yet  been 
forwarded. 

1.1.2. U      CU  Card  Test   Generation 

Of  the  sixty-six  CU  card  types  we  have  received  from  Burroughs: 
failure   detection  has  been  finished  on  29   and  is   in  progress   on  7, 
failure  location   is   finished  on  25   and  is   in  progress    on  k,    and  test 
translation  is    completed  on  7- 

The   attempt  is  being  made  to   account   for  all  possible  single 
failures  which  will   affect  the  operation  of  the  board.      The   few  failures 
not  tested  are  those  masked  by  logic  redundancy   or  those  which  cannot  be 
simulated  properly   due   to  pragmatic  limitations   in  the  simulator  system. 

The   completion  of  test  translation  on  new  boards   and  the 
arrival  of  3  part   output  paper  is   necessary  before   sending  more  material 
to  Burroughs.      The   successful  operation  of  the  Burroughs    card  tester  is 
also  awaited. 

1.1.2.5      Failure  Analysis 

During  this   quarter,    final  memos   describing  circuit  failures 
in  the  dual-in-line  packages  were   received  from  Burroughs .      The   circuit 
failures  were   compared  with  the  logic   failure  modes  being  used  in  the 

-710s tic  test  generation  system.      The   logic   failure  modes  were  found  to 
be  correct  and  complete  under  the   assumption  that  only  single  stuck-type 
lures   occur. 
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1.1.3     PE  Diagnostics 

1.1.3.1  Path  Tests 

The  detection  and  location  phase  path  tests  were  maintained 
in  this  period.   They  will  be  used  for  production  testing  of  the  PE's, 

1.1.3.2  Combinational  Tests 


Generation  of  combinational  tests  has  been  completed.   There 
are  35  tests  covering  the  address  adder  (ADA) ,  barrel  switch  (BSW) , 
carry  propagate  adder  (CPA),  carry  save  adder  (CSA),  multiplier  decoder 
gates  (MDG) ,  and  multiplicand  select  gates  (MSG).   All  tests  except 
those  for  the  BSW  have  been  verified  by  the  PE  simulator. 

1.1.3-3  Control  Logic  Tests 

The  main  efforts  in  this  quarter  were  to  update  and  debug  the 
test  generation  programs  for  the  control  logic  and  the  design  of  micro- 
sequences  for  the  initialization  of  the  tests.   Initialization  for  the 
control  logic  test  has  been  divided  into  several  steps  .   The  PEX 
assembly  language  procedures  to  initialize  the  control  latches  have 
been  written. 

1.1.3.U  Functional  Tests 


Except  for  a  few  tests,  verification  of  the  Burroughs  func- 
tional tests  using  the  PE  Simulator  has  been  completed.   Some  of  the 
tests  should  be  modified  to  include  initialization  routines.   Evaluation 
of  the  tests  is  continuing. 
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2.   SOFTWARE 


2.1  Operating  System 

2.1.1  Operating  System  I 

Current  work  on  Operating  System  I  may  be  divided  into  the 
following  areas : 

•  The  Job  Parser:   a  module  which  translates  the  control 

language,  ICL  (ILLIAC  Control  Language),  into  a 
manageable  form; 

•  B6500  modules  outside  the  MCP  (Master  Control  Program, 

the  B65OO  operating  system) ; 

•  Hardware  supervisor:   modifications  to  the  MCP  to 

incorporate  ILLIAC  into  the  system; 

•  Buffer  files:   an  extension  of  the  MCP  which  provides 

for  extensive  intermodule  communication  necessary  to 
support  the  design  principles  of  the  operating  system; 

•  ILLIAC-centered  modules,  the  Job  Partner  and  0S4  (the 

only  operating  system  module  which  runs  on  ILLIAC  IV). 

Progress  in  each  of  the  aforementioned  areas  may  be  related  as: 

•  The  Job  Parser  is  90  per  cent  coded  and  the  existing 

code  (on  the  B5500)  is  debugged  to  the  extent  that 
test  cases  are  processed  correctly; 

•  The  modules  of  the  system  which  are  exterior  to  the  MCP 

are  95  per  cent  coded.   Integration  of  these  modules 
and  the  Job  Parser  began  at  the  end  of  the  quarter; 

•  Debugging  of  the  long  contemplated  hardware  supervisor 

began  at  last,  with  the  availability  of  a  limping  B65OO 
in  Paoli.   Progress  here  will  accelerate  as  more 
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reliable  performance  is  achieved  by  the  B6500  hardware 
and  software; 

•  Buffer  files  have  "been  implemented  for  the  B5500  system 

and  have  been  working  well  for  several  months .  Work 
on  the  B65OO  version  will  be  done  in  Pasadena  with  the 
cooperation  of  the  Burroughs  software  development 
activities  there; 

•  The  Job  Partner  and  OS^  are  90  per  cent  coded  and  are 

undergoing  debugging.   Macros  are  being  developed  for 
use  in  conjunction  with  0Sl+  by  assembly  language  (ASK) 
users  (which,  of  course,  includes  GLYPNIR  users). 

Currently,  documentation  of  the  operating  system,  from  both 
user  and  maintenance  viewpoints,  is  the  chief  weak  point.   More  effort 
will  be  expended  on  these  items  in  the  next  quarter. 

2.1.2  Operating  System  II 

All  work  on  Operating  System  II  (OSIl)  has  been  suspended. 
Those  who  were  working  on  OSII  are  now  concentrating  on  Interactive 
Communications . 

2.2  Compilers  and  Translators 

2.2.1  GLYPNIR 

Specification  of  I/O  statements  is  still  undergoing  change. 
Implementation  of  I/O  for  the  simulator  is  under  way.   A  number  of 
routines  have  been  implemented:   including  routines  for  shifting, 
rotating,  and  routing. 

The  output  of  the  compiler  has  been  modified  to  be  compatible 
with  the  new  assembler  simulator.   Several  restrictions  in  the  lan- 
guage have  now  been  relaxed:   including  indexing  and  several  control 
statements . 
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A  number  of  compiler  routines  have  been  rewritten  to  make 
training  of  new  personnel  easier.   An  XALGOL  version  of  the  compiler  has 
also  "been  debugged. 

Evaluation  of  the  new  TWST  system  is  in  progress.  Work  on  the 
GLYPNIR  Macro  Generation  is  still  progressing. 

2.2.2  FORTRAN 

During  the  quarter,  the  design  of  an  extended  FORTRAN-like 
language  for  ILLIAC  IV  was  completed.   The  language  has  been  named 
COCKROACH.   Complete  language  specifications  have  been  documented  and 
carefully  reviewed  by  an  internal  project  review  committee.   A  broad 
sampling  of  potential  user  opinion  on  the  language  will  be  gathered  this 
Fall. 

In  addition  to  the  language  specifications ,  ILLIAC  IV  algorithm 
studies  were  also  completed  during  the  quarter.   All  major  (parallel  and 
sequential)  features  of  the  language  have  been  carefully  analyzed,  code 
written  for  their  implementation,  and  timing  evaluations  performed  to 
select  the  most  efficient  algorithms. 

Liaison  with  Burroughs  concerning  compiler  implementation  has 
continued  during  the  quarter.   Alternate  plans  for  compiler  implementation 
are  also  being  studied  in  detail. 

2. 3  Assembler 

The  Assembler  has  now  been  coded  in  ALLGOL  (a  special  version 
of  XALGOL  which  also  allows  stream  procedures)  and  hence  compiles  ASK 
programs  at  about  twice  its  previous  speed.   The  Assembler  now  allows 
working  "conditional"  constructs . 

The  Macrogenerator  is  in  the  final  stages  of  being  debugged. 

2.1*  Interactive  Communications  and  Graphics 

Due  to  the  revised  plans  for  site  selection  for  the  ILLIAC  IV 
,  the  plans  for  computing  facilities  for  the  Center  have  been 


■10- 


reduced,  for  the  present,  to  a  flexible  and  sophisticated  terminal  to 
the  ARPA  network. 

Initial  planning  evolves  around  the  following  configuration — 
a  PDP-11  System  with: 

l6K  word  memory; 

Card  reader  (200  CPM) ; 

Line  printer  (300  LPM) ; 

Disk  storage  (256K  words); 

DEC  tapes  (2  drives); 

2  storage  scope  terminals; 

12  CRT  terminals  (2^00  Baud); 

k   TTY  terminals ; 

Gould  U800  electrostatic  printer; 

High-speed  paper  tape  system. 

The  final  system  design  will  be  completed  during  the  next  quarter  and 
equipment  ordered. 

The  acquisition  of  a  DEC  PDP-10/ll  as  the  front  end  computer 
for  the  ILLIAC  IV  system  was  suspended  pending  further  discussion  on 
the  final  installation  site. 

Bids  were  sent  and  received  for  a  high-quality  microfilm 
recording  system  and  two  interactive  graphics  systems — a  minimal  system 
and  an  augmented  system.   Due  to  a  sharp  cutback  in  funding,  it  appears 
that  neither  system  will  be  acquired.   Thus,  further  action  has  been 
suspended,  pending  further  discussions  with  ARPA. 

2.5  B5500  Operation 


April 

May 

June 


Wo.  of 

Process 

Jobs 

Hours 

263UT5 

21*8.02 

17,607 

122.81 

19,283 

195.50 
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Use  of  the  machine  has  decreased  slightly  this  quarter.  A 
hardware  failure  brought  the  system  down  for  a  week  in  mid-May  which 
explains   the  drop  in  May's  totals. 

Batch  processing  is   offered  during  the  day  now,   and  operators 
are  on  duty  2i+  hours   a  day,    7  days   a  week. 
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APPLICATIONS 


3.1  Numerical  Analysis 

3.1.3     Numerical  Solution  of  Problems   in  Hydrodynamics 

In  the  numerical   solution  of  mult i -dimensional   fluid  dynamic 
problems,   the   difference   formulation  of  the  governing  quasi-linear 
partial  differential  equations   using  various    finite  difference  schemes 
should  be  studied  systematically  and  carefully  in  determining  the 
reliability  and  accuracy  of  the  numerical  solution.      In  keeping  with 
this   philosophy,   numerical  experimentation  with  Burger's   equation, 
which  has   the  same  vector  form  as   the   full  Navier-Stokes   equations, 
is   at  present  under  way.      The   following  finite  difference  schemes   are 
being  studied: 

•  Brailovskaya  scheme; 

•  Dufort-Frankel  scheme; 

•  Cheng- Allen  scheme; 

•  Crank-Nicolson  implicit  scheme. 

A  code  has  been  constructed  incorporating  these  finite  differ- 
ence schemes  which  provides  valuable  information  regarding  their  dif- 
ferent computational  characteristics  in  various  types  of  hydrodynamic 
problems.   A  graphical  display  code  has  been  interfaced  with  this  code; 
whereby,  it  is  possible  to  display  the  results  of  the  calculations  on 
the  ocsilloscope  and  obtain  direct  photographic  records. 

Tests  at  different  Reynolds  numbers  and  time  increments  have 
been  made;  the  steady  state  solution  obtained  as  the  asymptotic  long- 
time solution  of  the  non-steady  formulation  has  been  compared  with  the 
exact  steady  state  solution.   It  was  found,  for  instance,  that  for  a 
fixed  Reynolds  number,  the  accuracy  of  the  Brailovskaya  scheme  increases 
markedly  with  the  increase  in  the  value  of  the  time  increment.   The 
Crank-Nicolson  scheme  also  exhibits  an  increase  in  accuracy,  but  to  a 
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lesser  extent;   whereas,   the  Dufort-Frankel  scheme  shows    a  decrease  in 
accuracy  with   an  increase  in  the  time   step.      Contrary  to  the   accepted 
belief  that  the  discontinuity  thickness   reduces  to  a  smaller  number  of 
space  meshes  with  higher  Reynolds   numbers ,    it  is  seen   from  the 
calculations — for  instance,  with  the  Brailovskaya  scheme — that  the 
discontinuity  thickness   is   smaller  at   low  Reynolds   numbers   than  at   large 
Reynolds   numbers,  because   of  the   greater  computational  accuracy  at   lower 
Reynolds   numbers.      The  Brailovskaya  scheme,  however,  has    a  more  stringent 
limitation  on  the  time  step   from  stability   considerations   than  either 
the   Cheng-Allen  or  Dufort-Frankel  scheme,   as   the  stability  limit  depends 
explicitly   on  the   Reynolds   number.      This    could  be   a  limitation  in  using 
this   scheme   for  calculating  low  Reynolds   number  flows.      However,  with 
large   computers   like  the   ILLIAC  IV,  where   computational  time   is   not  the 
major   factor,   the   increased  accuracy  of  the  Brailovskaya  scheme  presents 
an  important   advantage. 

Two  different   codes,    one  incorporating  conservation  boundary 
conditions    and  the  other  using  reflection  boundary   conditions,  have  been 
constructed,   so  it   is  possible  to   study  the  effect   of  different   difference 
formulations   of  the  same  boundary   conditions.      The   influence  of  different 
physical  boundary   conditions   is   also  being  tested. 

In  order  to   study  the  suitability  of  the   finite  difference 
schemes   in  inviscid  flow  applications,    another  code  has  been  constructed 
for  the   inviscid  form  of  the  Burger's   equation,   viz, 

du       ur^u        _ 

Tests  with  this   code  have   shown,    for  instance,   that  the   centered-time, 
centered-space  Dufort-Frankel   representation  of  the  inviscid  equation  is 
unconditionally  unstable. 

These  and  other  results  will  be   forthcoming  in  the   form  of  a 
formal  report. 
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3.1.2  Matrix  Inversion  and  Solution  of  Linear  Algebraic  Equations 

The  method  of  Householder  triangularization  [1]  for  solving 
real  linear  algebraic  equations  or  for  inverting  a  real  matrix  and  the 
method  of  modification  [2]  for  inverting  a  matrix  (matrix  inversion  by 
rank  annihilation)  have  been  coded  in  ALGOL.   Both  algorithms  give 
accurate  results  for  well  conditioned  matrices. 

The  Householder  triangularization  algorithm  consists  of 
reducing  the  original  matrix,  A,  to  an  upper  triangular  one  in  (n  -  l) 
steps,  where,  n  is  the  order  of  the  matrix,  A,  the  rth  of  which  can  be 
described  by 


A  =  P  A  _ 
r    r  r-1 


r  =  1,  2,  ...,  n  -  1} 


(1) 


where,  A  =  A  is  the  original  matrix,  P  is  an  elementary,  orthogonal 
Hermitian  matrix,  given  by 

t 


P  =  I 
r 


u  u 

r  r 

2 
2KI 


(2) 


in  which  the  vector,  u  ,  has  its  first  (r  -  l)  elements  equal  to  zero 


u  =  (0,  ...,  0,  a    -f-  S  ,  a  _   ,  a  |0   ,  ...,  a   );   (3) 
r    v  r,r—  r'      r  -t-1 ,  r   r  +2 ,  r        n,r7     ' 


S  = 
r 


"  n     11/2 

y  2 

La.    i 

ri 
i=r 


(h) 


2K  =  S  (S   -  a   ); 
r    r  r  —  r,r 


(5) 


and  the  sign  in  (3)  and  (5)  is  chosen  such  that: 


a    +  S 
r,r  —  r 


r  ,r 


+  S  . 


(6) 
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Therefore,  if  AX  =  B  is  the  matrix  equation  to  be  solved,  where  A  is  an 
n  x  n  matrix  and  B  is  an  n  x  m  matrix,  then 

RX  =  C;  (7) 

where,  R=Pn_iPn_2  "'    *T^  ^"S  an  uPPer  triangular  matrix  and  C  =  P  -,  P n_o 
.  .  .  P-iB.   The  n  x  m  matrix,  X,  can  be  obtained  by  backsubstitution. 

A  scheme  for  the  iterative  improvement  of  the  resulting 
solution  has  also  been  programmed  in  ALGOL.   However,  for  very  ill- 
conditioned  matrices  the  whole  process  should  be  programmed  in  double 
precision  arithmetic. 

The  method  of  modification  can  be  described  by  the  relations: 
\  =  \-l  +  Ukek     (k  =  1}   2>      ">   n)'  (8) 


<\-i  +  Vk)_1  =  \-l  "  n  vi    <\-l  VkAk-l^     (9) 

lfeA-lUk 

where  A_  =  I  is  the  identity  matrix  and  A  =  A  is  the  original  matrix, 
0  n 

eJ:  being  the  kth  unit  vector,  and 

uk  =  (A  -  AR_1)  ek.  (10) 

Thus,  by  the  end  of  the  nth  step,  the  inverse  of  matrix  A  is  obtained. 
Although  this  method  fails  if  the  matrix  has  singular  principal  minors, 
it  appears  to  be  quite  practical  for  use  on  ILLIAC  TV.  Most  of  the 
computations  can  be  done  in  parallel  with  great  efficiency.  Moreover, 
the  method  is  highly  flexible  for  matrices  that  are  not  core  contained. 
This  method  has  been  programmed  in  ILLIAC  IV  assembly  language  (ASK). 
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3.1.3  Eigenvalue  Problems 


3.1.3.1  Jacob! ' s  Algorithm 

This  quarter  the  parallel  method  of  Jacobi  for  real  symmetric 
matrices  has  been  modified  again  so  as  to  speed  up  convergence.  For 
example,  for  matrices  of  order  n  =  2  ,  where  a  is  an  integer,  each 
orthogonal  transformation  eliminates  n  different  off -diagonal  elements, 
i.e.,  after  (n  -  l)  orthogonal  transformations  each  of  the  off-diagonal 
elements  would  have  been  annihilated  once. 


3.1.3.2  A  Jacobi-Like  Algorithm  for  Nonsymmetric  Matrices 

This  algorithm,  originally  developed  by  Eberlein  [3]>  nas 
been  modified  for  a  parallel  computer  to  obtain  the  eigenvalues  and 
eigenvectors  of  diagonalizable,  real  nonsymmetric  matrices.   This 
method  essentially  minimizes  the  norm  of  the  matrix  by  a  series  of 
similarity  transformations  until  the  matrix  is  arbitrarily  normal, 
i.e.,  A  .A*  -  AnAp  =  0.   Once  the  matrix  is  reduced  to  a  normal  one, 
the  convergent  optimal  procedure  of  Goldstine  and  Horwitz  [k]   would  be 
used  to  reduce  it  to  the  diagonal  form,  thus  yielding  the  eigenvalues. 
If  the  transformation  matrices  are  saved,  their  product  would  yield  the 
matrix  of  the  corresponding  eigenvectors. 

3.2  Linear  Programming 

Linear  Programming  (LP)  efforts  this  quarter  have  largely 
been  devoted  to  SSK  simulation  of  segments  of  the  Linear  Programming 
System  (LPS).   The  LP  data  preprocessor  described  last  quarter  was 
coded  this  quarter  and  is  in  the  last  stages  of  debugging  on  the 
simulator. 

In  order  to  reasonably  simulate  i/O  with  standard  coding,  a 
significant  amount  of  manpower  was  devoted  to  the  development  of  ASK 
i/O  macros  for  the  LP  programs.   These  macros  handle  file  descriptions, 
buffer  allocation,  buffer- read  assignments,  and  read,  write,  and 
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"advise"  statements.   They  have  been  coded  and  partially  simulator 
debugged  and  are  available  to  other  programmers  on  request. 

3.3  Long  Codes 

The  newly  completed  programs  to  model  the  behavior  of  a 
linear  system  and  to  identify  the  parameters  of  the  system  from  the 
outputs  generated  by  the  model  have  been  used  to  study  alternatives  to 
the  vector  extension  of  R.  C.  K.  Lee's  scalar  identification  algorithm. 
The  identification  programs  were  modified  to  provide  a  plot  of  the  norm 
of  the  error  vector,  i.e.,  the  difference  between  the  true  value  of  the 
transition  vector,  \|r,  and  the  current  estimate  of  \|/. 

The  resulting  plots  indicate  that  the  estimates  produced  by 
all  of  the  identification  programs  decrease  in  accuracy  for  a  short 
time  following  the  initial  estimate  before  they  begin  converging  toward 
the  true  value  of  \Jr.   One  of  the  more  promising  identification  schemes, 
an  algorithm  that  uses  summations  of  observation  vectors,  showed  an 
initial  gain  over  the  accuracy  of  the  extended  Lee  algorithm,  but  the 
summation  algorithm  and  its  variants  have  shown  poor  behavior  in 
comparison  to  the  extended  Lee  algorithm  after  a  large  number  of 
observations,  apparently  because  of  accumulated  round-off  errors. 

Experiments  have  been  designed  to  identify  the  cause  of  the 
degradation  and  to  discover  a  method  of  controlling  it. 

3.4  Large -Scale   Planning 

During  the  past  quarter,  the  model  mentioned  in  the  last  QPR 
has  been  conceptually  defined,  and  work  is  proceeding  on  the  detailed 
specification  of  its  components. 

Work  on  the  general  matrix  generator  system  has  been  con- 
tinuing, and  the  philosophy  of  this  system  has  been  defined.   The 
generator  will  take  advantage  of  the  geometric  structures  encountered 
LP  matrices,  and  will  enable  the  user  to  define  sub-arrays  of  the 
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model.   It  is  hoped  that  the  generator  system  will  be  suitable  for  use 
via  a  CRT  interface  with  the  ILLIAC  IV. 

Since  the  model-specific  generator  depends  upon  the  existence 
of  the  model,  no  progress  has  been  made  in  this  area. 

3*  5  Signal  Processing 

3«5«1  Normal  Moveout  Correction 

A  Compatible  ALGOL  program  for  Normal  Moveout  Correction 
(NMO)  has  been  written  and  debugged  and  is  available  for  use  on  the 
B5500  or  B65OO  computer.   The  program  uses  a  mapping  algorithm  to 
transform  non-normal  ray  travel  paths  to  a  normal  incidence  travel 
path.   The  algorithm  uses  the  At  correction  as  an  address  computation, 
and  to  preclude  square-root  calculations,  uses  a  specialized  iteration 
technique  for  address  calculation  which  converges  to  the  correct  value 
in  two  to  three  iterations. 

3.5«2  Stacking 

A  "stacking"  or  "summing"  program  has  been  written  in  ASK  to 
utilize  the  output  from  the  NMO  program  to  sum  several  records.   The 
input  signals  are  often  of  different  lengths,  with  different  starting 
and  ending  locations.   The  program  output  is  the  sum  of  the  input 
signals  normalized  by  dividing  by  the  number  of  signals  used  to  obtain 
the  sum.   The  program  uses  a  unique  algorithm  which  utilizes  negative 
zeros  at  the  leading  and  trailing  locations  of  the  input  vectors  to 
calculate  the  normalization  factor.   This  technique  permits  the  stack- 
ing of  any  length  records  on  ILLIAC  IV  and  calculating  the  proper 
normalization  factor  regardless  of  the  mute  location. 

3. 5*3  Bandpass  Filter 

An  ALGOL  program  has  been  written  which  computes  the  impulse 
responses  of  a  set  of  bandpass  filters.   These  filters  may  or  may  not 
be  combined,  as  the  user  chooses.   Three  types  of  graphical  outputs  are 
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printed,  as  follows:   (l)  the  zero-phase  impulse  response  of  the  filter 
which  passes  selected  frequencies  and  with  convergence  properties  that 
are  improved  by  using  Harming  weights;  (2)  the  percentage  of  the  total 
power  corresponding  to  any  given  length  of  the  filter;  and  (3)  the 
Fourier  transform  of  the  impulse  response.   Programs  have  "been  written 
using  both  conventional  and  fast  Fourier  transforms—their  timings  will 
he  compared  for  various  filter  lengths. 

3.5A  Weiner  Shaping  Filter  [5] 

Programs  for  the  calculation  of  shaping  filter  coefficients 
using  a  recursive  solution  of  the  simultaneous  equations  involving  an 
autocorrelation  matrix  have  been  written  in  compatible  ALGOL  and  ASK. 
The  technique  is  described  as  follows  — 


[f0>  fl'  ' 


,  f. 


m  ■ 


R, 


*1 
R0 


.  R 


m 


R    R  ,    R_ 
m    m-1    0 


=  [ 


>0'  *!> 


Given  the  autocorrelation  coefficients  [R_,  R-,  ,  ...,  R  1  and  the  right 

L  0   -L       mJ 

hand  side  coefficients  [g  ,  g  ,  ...,    e  ],  the  problem  is  to  solve  for 

[f^,  fn.  ....  f  1,  the  filter  coefficients. 

1    0'   1'     '   m-" 

By  taking  advantage  of  the  Toeplitz  structure  of  an  auto- 
correlation matrix,  the  filter  coefficients  may  be  solved  without 
inverting  the  autocorrelation  matrix. 

Steps  for  the  computations  are : 

1)   Set  up  the  initial  conditions, 


ao,o  "  1'    %  z   V  po  =  V  fo,o 


l0/r0>      7n  '"   f 


^o,ori' 


■  ;■(>. 


2)     k    =  -3  /a  } 


3)      an+l,0  ~  an,0 


n+1,1         n,l         n     n,n 


a.,        =  a  +  k     a     _ , 

L  n+l,n         n,n         n     n,l 


a,         -,  -,       —     K  a,  _ 

n+1, n+1         n     n,0 


a. .  when  i  =  j: 


a. .  when  i  ^  0,    j   =  0: 


a.  .  when  i  =  j: 
ij 


h)     a     n   =  a     +  k     6   : 

n+1         n         n     n 


5)     B,n=a     -,    ~  R     o  +  a     ,    ,   E     -    +  . . .    4-  a     ,        n    R-,  ; 
■"      Kn+1         n -1-1,0     n+2         n+1,1     n+1  n+1, n+1     ±' 


6)      q_   =    (g     ,-7   )/0!     ,  ; 

y      ^n        v&n+l         n"    n+1' 


T)     fn+l,0  "  fn,0  +  Si  an+l,ntl' 

f    ->  -,  =  f    -,  +•  q_    a    -, 

n+1,1         n,l        Ti     n+l,n 

wn+l,n         n,n        ti     n+1,1 

fn+l,n+l  =  Si  an+l,0' 


f .  .  when  i  4  0,    j   =  0: 


f .  .  when  i  4-  j; 


f . .  when  i  =  j: 


8)     7     _    =  f     ,    ~  R     0+f     ,   ,   R     .+...+  f     .        _   R_  . 

'      'n+1         n+1,0     n+2         n+1,1     n+1  n+1, n+1     1 

Following  the  establishment  of  the   initial  conditions,   the 

recursions   are  done  for  n=0ton=m-l  for  steps    (2)   to    (8).      The 

final  values  obtained  for  the  filter  coefficients    (namely,    f     _,    f     .  . 

v     J '     m,0   m,l 

...,  f   )  represent  the  desired  solution,  f~,  f ,  ,  ..•,  f  • 
m,nr    *  '01m 


3-5-5  Correlation  Analysis 

The  correlation  analysis  programs  discussed  in  the  last  QFR 
are  available  in  an  ILLLAC  IV  document  [6].   The  programs  have  been 
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assembled  as  object  modules  for  a  subroutine  library.   The  user  needs 
only  to  assemble  a  driver  program  (consisting  of  data)  which  calls  the 
assembled  subroutines. 

Work  has  begun  on  two  dimensional  correlation  programs.  An 
ASK  program  has  been  coded  to  compute  A(Z)  B(Z)  =  C(Z)  where  A,  B,  and  C 
are  matrices  and  C(Z)  has  square  dimensions  less  than  6k.      This  program 
has  been  debugged  and  simulated  on  the  ILLIAC  IV  B5500  simulator.  Work 
is  progressing  to  extend  this  program  to  handle  matrices  of  dimensions 
larger  than  6k. 

3-6  ILLIAC  IV  Education 

3.6.1  Documentation 

ILLIAC  IV  Research  Document  Abstracts,  a  publication  of 
ILLIAC  IV  Education,  is  being  distributed  to  all  FOURUM  subscribers  with 
the  July,  1970  issue  of  the  FOURUM  (FOURUM  No.  lk) .      This  document  will 
facilitate  user  acquisition  of  pertinent,  project-generated  research 
documents. 
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4.   ADMINISTRATION 


4.1  Administration  and  Services 

Budget  expenditures  for  fiscal  year  1970  are  as  follows: 

Burroughs  Corporation     $6,711,392.00 
University  of  Illinois    $2,197,646.65 

This  resulted  in  a  total  project  cost  through  fiscal  year  1970  as 
follows : 

Burroughs  Corporation    $22,516,924.00 
University  of  Illinois   $  5,203,908.51 

The  ILLIAC  IV  project  has  a  personnel  strength  of  133  people : 

Professional  39 

Non- Academic  22 

Research  Assistants  34 

Hourlies  31 

Illini  Girls  (Secretarial 

Assistants)  7 

133 

The  Center  for  Advanced  Computation  will  be  organized  as 
shown  under  4.2  (next  page). 

Precision  Instruments  Company  has  "been  contracted  for  the 
purchase  of  a  laser  storage  system.   The  system  will  have  a  storage 

-1  p  r 

capacity  of  10   bits  with  data  rates  of  4  x  10  bits  per  second  per 
channel.   Delivery  is  expected  to  be  in  15  months.   The  University  will 
be  doing  the  software  interface  to  the  B650O  while  Precision  Instruments 
will  be  doing  the  internal  software  for  the  laser  memory  as  well  as  the 
hardware  interface.   The  June  20  issue  of  Business  Week  carried  a  brief 
article  regarding  the  laser  memory. 
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k.2     Organization  Chart 


CENTER  FOR  ADVANCED  COMPUTATION 

Professor  D.  L.  Slotnick,  Director 

ADMINISTRATION 

R.  F.  Borelli 

APPLICATIONS 

SYSTEMS  SERVICES 

SOFTWARE 

D.  E.  Mclntyre 

G.  A.  Westlund 

J.  M.  Randal 

Meteorology 

Operations 

Operating  System 

Numerical  Algorithms 

Hardware  Support 

Interactive 

Communications 

and  Graphics 

Linear  Programming 

Education 

and 
Information  Retrieval 

Applications  Software 

FORTRAN 

Seismic 

Ecology 

ask/ssk 

Large  Scale  Planning 

GLYPNIR 

Special  Projects 
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